Conversation
9c93e54 to
0303b90
Compare
0303b90 to
ba7c900
Compare
74bb269 to
cec99ef
Compare
fd971e0 to
7f43434
Compare
|
@matthewhoffman and @trhille, to test this for now, use: This branch is needed until I tag a 3.3.0rc2 for mache. |
|
@xylar , can you walk me through a few more details about the transition to First off, is the mache branch in your previous comment out of date? Mache branch It ran great for awhile and seemed much faster than the old Am I doing this wrong? Is this trying to deploy for the entire project? I don't think you want me interacting with |
|
@matthewhoffman, I'm sorry. I'm developing I needed to release mache 3.3.0 for Polaris yesterday. As a result, the update-to-3.3.0 branch is gone. But I neglected to update this Compass branch until just now. At this point, no You also don't want to deploy spack. That was a mistake in my command above. |
d3cc399 to
21d2713
Compare
|
Thanks, @xylar . I made a little more progress with the command you suggested. I had to make this change: but then I still ran into an issue of it trying to touch the deployed spack env in the e3sm project space: |
I think that's in 21d2713. Did you not have that commit or did I miss something? |
Yep, that's something I need to fix. Sorry about that! |
|
@matthewhoffman, the second issue should be fixed. |
|
@xylar , thanks for addressing the second issue. The first must have been because I had failed to update my local branch this morning. After updating to 160d75d , |
The Compass version is in there: It's just called something different than before. We can copy that into another environment variable if you need it. |
|
Oh, wait, it already is: |
|
Are you not seeing that in you load script? |
|
I just mean the name of the load script used to have the version in the filename, but I'm not seeing that. It's not a big deal, I was just wondering if that was intentional. As for progress, when I compile MALI I am seeing the same PIO lib errors that you do in the issue you opened. I'm working on debugging them with help from ChatGPT and so far the obvious things are not working, but I'll keep at it while I have time. |
|
I see. No, the load script won't include the compass version anymore. I didn't find that to be particularly useful. |
|
I found it useful to know how old my load script was when I revisited a workdir I hadn't visited recently, but I could easily get that information from opening the load script. I have only very, very rarely needed to use an old load script older, so having the load script clobbered when the version changes isn't really a concern. |
|
The way you things work now, you can't have more than one pixi environment in a given branch so it isn't really meaningful to have different versions of the load script for different versions. You should consider using different worktrees for different versions. |
This keeps ESMF from stepping on SCORPIO's toes by installing its own ParallelIO.
|
@matthewhoffman, I think the cleanest solution is the same as what Polaris does -- put ESMF in a software environment and SCORPIO in a library environment so they don't clobber each other. We only use ESMF as a binary so that works out. |
This is no longer needed now that ESMF is in the software environment.
|
I'm regenerating the spack environments. I'll test this out on both Perlmutter and Chrysalis (the latter before it's too late!). |
|
@matthewhoffman, I think the linking issue is likely fixed. I'm trying a rebuild of MALI now on Chrysalis. I'll try Perlmutter tomorrow. But looking good so far... |
|
The Chrysalis and Perlmutter-CPU builds with gnu worked fine (I didn't try to run the test suite). I'm building with Perlmutter-GPU, gnugpu now... |
|
@xylar , I was also able to build the conda env using the new deploy.py script on pm-cpu and successfully compile MALI at @mperego , can you look at Full compass results:full_integration resultsResults at |
|
Hi Matt,
Thanks for trying it out.
The test you pointed me to, fails for a floating point exception. It would be interesting to know if it would pass if you don't check for FPEs.
Anyway, I can try to reproduce it in the next days and see if we can fix it.
Mauro
…________________________________
From: Matt Hoffman ***@***.***>
Sent: Saturday, April 4, 2026 2:39 PM
To: MPAS-Dev/compass ***@***.***>
Cc: Perego, Mauro ***@***.***>; Mention ***@***.***>
Subject: [EXTERNAL] Re: [MPAS-Dev/compass] Update to v2.0.0-alpha.1 (PR #944)
[https://avatars.githubusercontent.com/u/4182034?s=20&v=4]matthewhoffman left a comment (MPAS-Dev/compass#944)<#944 (comment)>
@xylar<https://github.com/xylar> , I was also able to build the conda env using the new deploy.py script on pm-cpu and successfully compile MALI at develop. When I run our full_integration suite, however, I see a strange mix of pass and fail. I expected the baseline comparison for runs with Albany to fail, but I'm also seeing some execution fails. There's a lot to sift through there, so I'm just going to post the full results in a collapsed section for now. I'm seeing a some PIO errors killing the model, and I'm also seeing Albany strangely abort after the solver completes. I think I'll need to debug all these failures before moving things forward.
@mperego<https://github.com/mperego> , can you look at /pscratch/sd/h/hoffman2/COMPASS/TESTING/landice/dome/2000m/fo_decomposition_test/1proc_run/log.albany.0000.out and see if you can tell what happened to that run?
Full compass results:
full_integration results
Results at /pscratch/sd/h/hoffman2/COMPASS/TESTING
verifying deployed compass version...
Verified version 2.0.0-alpha.1.
loading compute pixi env...
pixi env loaded.
activating spack env...
Activating Modules:
1) cray-netcdf-hdf5parallel/4.9.2.1
spack env activated.
loading compass environment variables...
compass environment variables loaded.
landice/dome/2000m/sia_restart_test
* step: setup_mesh
* step: full_run
* step: restart_run
test execution: SUCCESS
test validation: PASS
baseline comparison: PASS
test runtime: 00:24
landice/dome/2000m/sia_decomposition_test
* step: setup_mesh
* step: 1proc_run
* step: 4proc_run
test execution: SUCCESS
test validation: PASS
baseline comparison: PASS
test runtime: 00:06
landice/dome/variable_resolution/sia_restart_test
* step: setup_mesh
* step: full_run
* step: restart_run
test execution: SUCCESS
test validation: PASS
baseline comparison: PASS
test runtime: 00:07
landice/dome/variable_resolution/sia_decomposition_test
* step: setup_mesh
* step: 1proc_run
* step: 4proc_run
test execution: SUCCESS
test validation: PASS
baseline comparison: PASS
test runtime: 00:05
landice/enthalpy_benchmark/A
* step: setup_mesh
* step: phase1
* step: phase2
* step: phase3
* step: visualize
test execution: SUCCESS
baseline comparison: PASS
test runtime: 00:23
landice/eismint2/decomposition_test
* step: setup_mesh
* step: 1proc_run
* step: 4proc_run
test execution: SUCCESS
test validation: PASS
baseline comparison: PASS
test runtime: 00:16
landice/eismint2/enthalpy_decomposition_test
* step: setup_mesh
* step: 1proc_run
* step: 4proc_run
test execution: SUCCESS
test validation: PASS
baseline comparison: PASS
test runtime: 00:18
landice/eismint2/restart_test
* step: setup_mesh
* step: full_run
* step: restart_run
test execution: SUCCESS
test validation: PASS
baseline comparison: PASS
test runtime: 00:21
landice/eismint2/enthalpy_restart_test
* step: setup_mesh
* step: full_run
* step: restart_run
test execution: SUCCESS
test validation: PASS
baseline comparison: PASS
test runtime: 00:17
landice/greenland/sia_restart_test
* step: full_run
* step: restart_run
Failed
test execution: ERROR
see: case_outputs/landice_greenland_sia_restart_test.log
test runtime: 00:10
landice/greenland/sia_decomposition_test
* step: 16proc_run
* step: 32proc_run
test execution: SUCCESS
test validation: PASS
baseline comparison: FAIL
see: case_outputs/landice_greenland_sia_decomposition_test.log
test runtime: 00:08
landice/hydro_radial/restart_test
* step: setup_mesh
* step: full_run
* step: visualize_full_run
* step: restart_run
* step: visualize_restart_run
test execution: SUCCESS
test validation: PASS
baseline comparison: FAIL
see: case_outputs/landice_hydro_radial_restart_test.log
test runtime: 00:41
landice/hydro_radial/decomposition_test
* step: setup_mesh
* step: 1proc_run
* step: visualize_1proc_run
* step: 3proc_run
* step: visualize_3proc_run
test execution: SUCCESS
test validation: PASS
baseline comparison: FAIL
see: case_outputs/landice_hydro_radial_decomposition_test.log
test runtime: 00:11
landice/humboldt/mesh-3km_decomposition_test/velo-none_calving-none_subglacialhydro
* step: 16proc_run
* step: 32proc_run
test execution: SUCCESS
test validation: PASS
baseline comparison: FAIL
see: case_outputs/landice_humboldt_mesh-3km_decomposition_test_velo-none_calving-none_subglacialhydro.log
test runtime: 00:18
landice/humboldt/mesh-3km_restart_test/velo-none_calving-none_subglacialhydro
* step: full_run
* step: restart_run
test execution: SUCCESS
test validation: PASS
baseline comparison: FAIL
see: case_outputs/landice_humboldt_mesh-3km_restart_test_velo-none_calving-none_subglacialhydro.log
test runtime: 00:48
landice/dome/2000m/fo_decomposition_test
* step: setup_mesh
* step: 1proc_run
Failed
test execution: ERROR
see: case_outputs/landice_dome_2000m_fo_decomposition_test.log
test runtime: 00:11
landice/dome/2000m/fo_restart_test
* step: setup_mesh
* step: full_run
Failed
test execution: ERROR
see: case_outputs/landice_dome_2000m_fo_restart_test.log
test runtime: 00:08
landice/dome/variable_resolution/fo_decomposition_test
* step: setup_mesh
* step: 1proc_run
Failed
test execution: ERROR
see: case_outputs/landice_dome_variable_resolution_fo_decomposition_test.log
test runtime: 00:11
landice/dome/variable_resolution/fo_restart_test
* step: setup_mesh
* step: full_run
Failed
test execution: ERROR
see: case_outputs/landice_dome_variable_resolution_fo_restart_test.log
test runtime: 00:10
landice/circular_shelf/decomposition_test
* step: setup_mesh
* step: 1proc_run
* step: 4proc_run
test execution: SUCCESS
test validation: PASS
baseline comparison: FAIL
see: case_outputs/landice_circular_shelf_decomposition_test.log
test runtime: 00:20
landice/greenland/fo_decomposition_test
* step: 16proc_run
Failed
test execution: ERROR
see: case_outputs/landice_greenland_fo_decomposition_test.log
test runtime: 00:13
landice/greenland/fo_restart_test
* step: full_run
Failed
test execution: ERROR
see: case_outputs/landice_greenland_fo_restart_test.log
test runtime: 00:12
landice/thwaites/fo_decomposition_test
* step: 16proc_run
Failed
test execution: ERROR
see: case_outputs/landice_thwaites_fo_decomposition_test.log
test runtime: 00:14
landice/thwaites/fo_restart_test
* step: full_run
Failed
test execution: ERROR
see: case_outputs/landice_thwaites_fo_restart_test.log
test runtime: 00:12
landice/thwaites/fo-depthInt_decomposition_test
* step: 16proc_run
Failed
test execution: ERROR
see: case_outputs/landice_thwaites_fo-depthInt_decomposition_test.log
test runtime: 00:11
landice/thwaites/fo-depthInt_restart_test
* step: full_run
Failed
test execution: ERROR
see: case_outputs/landice_thwaites_fo-depthInt_restart_test.log
test runtime: 00:13
landice/humboldt/mesh-3km_restart_test/velo-fo_calving-von_mises_stress_damage-threshold_faceMelting
* step: full_run
* step: restart_run
test execution: SUCCESS
test validation: PASS
baseline comparison: FAIL
see: case_outputs/landice_humboldt_mesh-3km_restart_test_velo-fo_calving-von_mises_stress_damage-threshold_faceMelting.log
test runtime: 00:49
landice/humboldt/mesh-3km_restart_test/velo-fo-depthInt_calving-von_mises_stress_damage-threshold_faceMelting
* step: full_run
* step: restart_run
test execution: SUCCESS
test validation: PASS
baseline comparison: FAIL
see: case_outputs/landice_humboldt_mesh-3km_restart_test_velo-fo-depthInt_calving-von_mises_stress_damage-threshold_faceMelting.log
test runtime: 00:20
Test Runtimes:
00:24 PASS landice_dome_2000m_sia_restart_test
00:06 PASS landice_dome_2000m_sia_decomposition_test
00:07 PASS landice_dome_variable_resolution_sia_restart_test
00:05 PASS landice_dome_variable_resolution_sia_decomposition_test
00:23 PASS landice_enthalpy_benchmark_A
00:16 PASS landice_eismint2_decomposition_test
00:18 PASS landice_eismint2_enthalpy_decomposition_test
00:21 PASS landice_eismint2_restart_test
00:17 PASS landice_eismint2_enthalpy_restart_test
00:10 FAIL landice_greenland_sia_restart_test
00:08 FAIL landice_greenland_sia_decomposition_test
00:41 FAIL landice_hydro_radial_restart_test
00:11 FAIL landice_hydro_radial_decomposition_test
00:18 FAIL landice_humboldt_mesh-3km_decomposition_test_velo-none_calving-none_subglacialhydro
00:48 FAIL landice_humboldt_mesh-3km_restart_test_velo-none_calving-none_subglacialhydro
00:11 FAIL landice_dome_2000m_fo_decomposition_test
00:08 FAIL landice_dome_2000m_fo_restart_test
00:11 FAIL landice_dome_variable_resolution_fo_decomposition_test
00:10 FAIL landice_dome_variable_resolution_fo_restart_test
00:20 FAIL landice_circular_shelf_decomposition_test
00:13 FAIL landice_greenland_fo_decomposition_test
00:12 FAIL landice_greenland_fo_restart_test
00:14 FAIL landice_thwaites_fo_decomposition_test
00:12 FAIL landice_thwaites_fo_restart_test
00:11 FAIL landice_thwaites_fo-depthInt_decomposition_test
00:13 FAIL landice_thwaites_fo-depthInt_restart_test
00:49 FAIL landice_humboldt_mesh-3km_restart_test_velo-fo_calving-von_mises_stress_damage-threshold_faceMelting
00:20 FAIL landice_humboldt_mesh-3km_restart_test_velo-fo-depthInt_calving-von_mises_stress_damage-threshold_faceMelting
Total runtime 08:18
FAIL: 19 tests failed, see above.
—
Reply to this email directly, view it on GitHub<#944 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/ABNKKZB5JRHIDWT3BG7OVZL4UFXITAVCNFSM6AAAAACWZ7OQGCVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHM2DCOBXG4YTIMRYGI>.
You are receiving this because you were mentioned.Message ID: ***@***.***>
|
|
@mperego , I recompiled with Do you have an idea of how to find the FPE to fix it? Or do you want me to try to track anything further down on my end? Assuming you're able to track it down, it sounds like we'll want to update spack to use that updated version of Albany in this PR (cc: @xylar ). |
This pull request updates to
mache.deploy, which uses the./deploy.pyscript instead of./conda/configure-compass-env.py.It switches to using
pixiin the background for creating environments with conda packages.Updates:
mache.deploy,mache.jigsawandmache.parallelas well as module updates on many machines and several bug fixescompass-2026-03-21compass-2026-02-06Testing
Only testing MALI, as MPAS-Ocean is no longer being tested regularly on Compass.
MALI with
full_integration:Deployed
MALI with
full_integration: